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Abstract 

The field of machine learning (ML) is concerned with the question of 
how to construct algorithms that automatically improve with experi- 
ence. In recent years many successful ML applications have been de- 
veloped, such as datamining programs, information-filtering systems, 
etc. Although ML algorithms allow the detection and extraction of 
interesting patterns of data for several kinds of problems, most of 
these algorithms are based on quantitative reasoning, as they rely on 
training data in order to infer so-called target functions. 

In the last years defeasible argumentation has proven to be a sound 
setting to formalize common-sense qualitative reasoning. This ap- 
proach can be combined with other inference techniques, such as those 
provided by machine learning theory. 

In this paper we outline different alternatives for combining de- 
feasible argumentation and machine learning techniques. We suggest 
how different aspects of a generic argument-based framework can be 
integrated with other ML-based approaches. 

1 Introduction and motivations 



The ability to generate and collect data has increased exponentially in the 
last years. Automatizing transactional databases has resulted in an explosive 



growth of information, motivating the evolution of several datamining tech- 
niques. Simply stated, datamining refers to extracting or mining knowledge 
from large amounts of data. This knowledge constitutes non-trivial and po- 
tentially useful information that can be obtained in many cases by applying 
machine learning (ML) techniques. 

The field of machine learning is concerned with the question of how to 
construct algorithms that automatically improve with experience |Mit97j . 
In recent years many successful ML applications have been developed, such 
as datamining programs, information-filtering systems, etc. Although ML 
algorithms allow the detection and extraction of interesting patterns of data 
for several kinds of problems, most of these algorithms provide an output 
based on quantitative evidence (i.e. training data), whereas the inference 
process which led to this output is commonly unknown (i.e. 'black-box' 
metaphor). 

In the last years defeasible argumentation |(MLnn[ IPV991 Wm\ has 
proven to be a sound setting to formalize common-sense qualitative reason- 
ing. This approach can be combined with other inference techniques, such 
as those provided by machine learning theory. 

In this preliminary report we explore different alternatives for developing 
applications which combine defeasible argumentation and machine learning 
techniques. We suggest how different aspects of a generic argument-based 
framework can be integrated with other ML-based approaches. The paper 
is structured as follows. First, we briefly introduce the components of most 
argument-based framework and then outline possible directions for the inte- 
gration of ML techniques and defeasible argumentation. Next, we describe a 
particular setting suitable for the application of such approach, namely text 
mining problems. Finally, we discuss some promising research lines that are 
currently being pursued. 

2 Integrating ML and argument-based frame- 
works 

Argument-based frameworks [ SL921 ICMLOO| IPV99j provide a sound formal- 
ization of defeasible reasoning, and have found a wide acceptance in many 
areas such as development of legal reasoning applications, multiagent sys- 
tems, etc. As pointed out in |CRL00j . most argument-based frameworks 



share a number of common notions, namely: 

1. Knowledge Base formalized in an underlying logical language: 

Most argument-based frameworks involve a knowledge base K = (11, A) 
which provides background knowledge for an intelligent agent formalized 
in a first-order logical language L. This background knowledge typically 
involves a set 11 of strict rules and facts and a set A of defeasible rules. 

2. Argument: An argument is a defeasible proof obtained from the 
knowledge base K by applying suitable (defeasible) inference rules as- 
sociated with the underlying logical language L. 

3. Dialectical reasoning: Given two arguments A and B, conflict (or 
attack) among arguments arises whenever A and B cannot be simul- 
taneously accepted (typically because of some kind of logical contra- 
diction). Many argument systems provide a preference criterion which 
defines a partial order among arguments, allowing to determine whether 
A should be preferred over B. This defines a defeat relationship. Given 
the set Args of arguments obtained from a knowledge base K, it holds 
that attacks C Args x Args, and defeats C attacks. In order to 
determine whether a given argument A is ultimately undefeated (or 
warranted), a dialectial process is recursively carried out, where de- 
featers for A, defeaters for these defeaters, and so on, are taken into 
account. 

Next we will outline different approaches that we are currently considering 
to model the above issues in the context of ML techniques. 

• Building a defeasible knowledge base from training data: 

Recently there have been some approaches to obtaining defeasible rules 
from training data, particularly in the context of Inductive Logic Pro- 
gramming (ILP). In |IK97j ■ the authors present a method to generate 
non-monotonic rules with exceptions from positive /negative examples 
and background knowledge in Inductive Logic Programming. The form 
of the programs to be learnt is the one of extended logic programs, which 
incorporates both strict negation and negation as failure that can be ef- 
fectively used in the presence of incomplete information. Default rules 
are generated as specializations of general rules that cover positive ex- 
amples, whereas exceptions to general rules are identified from negative 



examples and are then generalized to rules for cancellation of defaults. 
The resulting learning system LELP allows also to learn hierarchical 
defaults by recursively calling an exception identification algorithm. 

In |GS01j . the authors investigate the feasibility of Knowledge Dis- 
covery from Data-bases (KDD) in order to facilitate the discovery of 
defeasible rules for legal decision making. In particular they argue in 
favour of Defeasible Logic as an appropriate formal system in which 
the extracted principles should be encoded. 

In the context of obtaining defeasible rules by means of induction-based 
techniques, the work of Peter Flach |Fla98j provides an interesting sur- 
vey of several approaches to computational induction and provides a 
descriptive theory of induction as a logical framework system. Several 
rule systems for conjectural reasoning are axiomatized and semantically 
characterized. 

Building arguments: 

Finding hypotheses which explain unseen instances is central to most 
ML algorithms. Interestingly, in argument-based frameworks an argu- 
ment is also understood as providing "a hypothesis supporting a given 
conclusion" |SL92j . We contend that argument-based reasoning and 
many ML techniques share this common notion. Many theoretical re- 
sults from argument theory could therefore be applied in a ML context. 

Analytical learning methods |Mit97j (like explanation-based learning) 



offer the advantage of generalizing more accurately from less data by 
using prior knowledge to guide learning. However, they can be mis- 
led when given incorrect or insufficient prior knowledge. On the other 
hand, inductive learning methods (like neural nets, decision tree learn- 
ing, inductive logic programming) offer the advantage that they require 
no explicit prior knowledge and learn regularities based solely on the 
training data. However, they can fail when given insufficient training 
data, and can be misled by the implicit inductive bias they must adopt 
in order to generalize beyond the observed data. A combined analytical 
and inductive ML method that overcomes the pitfalls associated with 
each separate approach (yet conserving their individual advantages) 
should be as follows: given a set of training examples D of some target 
function / (possibly containing errors), a domain theory B (possibly 



containing errors), and a space of candidate hypothesis determines 
which hypothesis h fits best the training examples and domain theory. 

We think that defeasible argumentation can be used to improve ML ap- 
proaches as the one described above as it provides a sound formalization 
for both expressing and reasoning with uncertain and incomplete infor- 
mation. From the set of examples D and theory B, a knowledge base 
(n, A) can be induced as explained above. The hypothesis h can be 
expressed as an argument structure {A, h) where A stands for both the 
relevant part of the background theory and relevant features of D. 

• Dialectical reasoning and ML: 

The process of defeasible argumentation always involves the analysis 
of conflicting arguments in a dialectical setting. As explained before, 
such setting relies on a defeat relationship for comparing conflicting 
arguments. Determining whether an argument is ultimately accepted 
requires a recursive analysis in which defeaters, defeaters for these de- 
featers, and so on, are taken into account jCMLOOj . 

In many formalizations of argumentation the notion of defeat is con- 
sidered as an abstract relationship (i.e. a partial order ^ among argu- 
ments). Some approaches (e.g. |SL92j ) rely on the notion of specificity 
for defining defeat. In this context, ML techniques provide sound al- 
ternatives for considering numeric attributes or probabilistic values for 
deciding between conflicting hypotheses jMit97j . We think that such 
approaches could provide the basis for defining new comparison crite- 
ria in the context of argument-based frameworks. In the same line of 
reasoning, contrasting conflicting hypotheses is a common situation in 
many ML algorithms. In this setting defeasible argumentation provides 
a useful theoretical background for contrasting such hypotheses, mak- 
ing easier to identify fallacious patterns of reasoning which might lead 
to incorrect results. 

3 Text mining: a case study 

Datamining is the process of discovering interesting patterns from large 
amounts of data [ HKOOj . In textual datamining the needs of the user may 
vary from looking for a specific piece of text to getting familiarized with a 



subject area |Hon97j . Basic approaches for information retrieval and data 
mining involve the following strategies: 



• Searching using keywords: This is done by automatically indexing the 
documents by frequency of term appearance. This approach is used 
by web crawlers |BP00j and traditional information retrieval systems 
|FBY92] . 

• Exploration of the document collection supported by organizing the doc- 
uments in some manner: For each document, an internal representa- 
tion is obtained and then the documents are arranged into clusters by 
some similarity measure |Ras92j . This process can be done by using ML 
techniques such as neural networks |GL01at IGL01b| IHon97| IKKL+OO] 
or bayesian classifiers |BP97[ IPB97j . 

• Filtering: It refers to discarding uninteresting documents from an in- 
coming document flow [M MIP97] . 

Each approach has its pros and cons. Searching using keywords is easy to 
implement but can lead to retrieve unrelated documents or, even worse, to 
not discover related documents (this can be measured using some metric such 
as precision and recall |FB Y92j ) . Clustering of documents can be efficiently 



implemented by neural nets but may require retraining in the presence of new 
examples; besides, in this case the model is not understandable by a human 
programmer because it is compiled as a set of numerical weights. Filtering is 
difficult because of the dynamic nature of user interests and document flow 

Some recent approaches propose using argumentation to analyze struc- 
tured text in the form of XML documents [ BHOl| IHunOlj . As explained in 
the previous section, we feel that the integration of defeasible argumentation 
and ML can tackle many of the problems described above, thus enhancing 
existing algorithms for text mining. 



4 Conclusions and future work 

The success of argumentation-based approaches is partly due to the sound 
setting it provides for qualitative reasoning. Although numeric attributes 
offer an useful source of information for quantitative reasoning in several 



knowledge domains, they have been mostly neglected in the defeasible argu- 
mentation community. This is maybe due to the historical origins of argu- 
mentative reasoning, which were more related to legal (qualitative) reasoning 
rather than to number-based attributes as those used in rule-based produc- 
tion systems. 

We think that integrating ML techniques with argumentation frameworks 
would be highly desirable, as it would provide a combination of both ana- 
lytical and inductive ML methods capable of tackling the pitfalls of each 
separately yet conserving their advantages, making them more attractive 
and suitable for other research and application areas. Part of our current 
research work is focused on these aspects. 
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